First occurrence of a word among the elements of a finite dictionary in random sequences of letters∗
نویسندگان
چکیده
In this paper we study a classical model concerning occurrence of words in a random sequence of letters from an alphabet. The problem can be studied as a game among (m + 1) words: the winning word in this game is the one that occurs first. We prove that the knowledge of the first m words results in an advantage in the construction of the last word, as it has been shown in the literature for the cases m = 1 and m = 2 [CZ79, CZR09]. The last word can in fact be constructed so that its probability of winning is strictly larger than 1/(m + 1). For the latter probability we will give an explicit lower bound. Our method is based on rather general probabilistic arguments that allow us to consider an arbitrary cardinality for the alphabet, an arbitrary value for m and different mechanisms generating the random sequence of letters.
منابع مشابه
A Further Note on Runs in Independent Sequences
Given a sequence of letters generated independently from a finite alphabet, we consider the case when more than one, but not all, letters are generated with the highest probability. The length of the longest run of any of these letters is shown to be one greater than the length of the longest run in a particular state of an associated Markov chain. Using results of Foulser and Karlin (19...
متن کاملNon-Abelian Sequenceable Groups Involving ?-Covers
A non-abelian finite group is called sequenceable if for some positive integer , is -generated ( ) and there exist integers such that every element of is a term of the -step generalized Fibonacci sequence , , , . A remarkable application of this definition may be find on the study of random covers in the cryptography. The 2-step generalized sequences for the dihedral groups studi...
متن کاملOn Runs in Independent Sequences
Given an i.i.d. sequence of n letters from a finite alphabet, we consider the length of the longest run of any letter. In the equiprobable case, results for this run turn out to be closely related to the well-known results for the longest run of a given letter. For coin-tossing, tail probabilities are compared for both kinds of runs via Poisson approximation.
متن کاملSearch Space Reduction for Farsi Printed Subwords Recognition by Position of the Points and Signs
In the field of the words recognition, three approaches of words isolation, the overall shape and combination of them are used. Most optical recognition methods recognize the word based on break the word into its letters and then recogniz them. This approach is faced some problems because of the letters isolation dificulties and its recognition accurcy in texts with a low image quality. Therefo...
متن کاملUsing it Bundles in Published and Unpublished Writings
Lexical bundles are known as important elements of coherent discourse that have been the subject of much research. While the previous research has been mainly concerned with exploring variations in the use of these word sequences across different registers and disciplines, very few studies have addressed the use of some particular groups of lexical bundles within some types of academic writing....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012